GCGGGGGGCAGGAAGGGGCGGGGGGCTCGGCGCACTCGGCAGGAAGAGACCGACCCGCC 

ACCCGCC GTAGCCCGCGCGCCCCTGGCACTCAATCCCCGCCA TGTGGGGGCTCCTGCTCG 

CCCTGGCCGCCTTCGCGCCGGCCGTCGGCCCGGCTCTGGGGGCGCCCAGGAACTCGGTGC 

TGGGCCTCGCGCAGCCCGGGACCACCAAGGTCCCAGGCTCGACCCCGGCCCTGCATAGCA 

GCCCGGCACAGCCGCCGGCGGAGACAGCTAACGGGACCTCAGAACAGCATGTCCGGATT 

CGAGTCATCAAGAAGAAAAAGGTCATTATGAAGAAGCGGAAGAAGCTAACTCTAACTCG 

CCCCACCCCACTGGTGACTGCCGGGCCCCTTGTGACCCCCACTCCAGCAGGGACCCTCGA 

CCCCGCTGAGAAACAAGAAACAGGCTGTCCTCCTTTGGGTCTGGAGTCCCTGCGAGTTTC 

AGATAGCCGGCTTGAGGCATCCAGCAGCCAGTCCTTTGGTCTTGGACCACACCGAGGACG 

GCTCAACATTCAGTCAGGCCTGGAGGACGGCGATCTATATGATGGAGCCTGGTGTGCTGA 

GGAGCAGGACGCCGATCCATGGTTTCAGGTGGACGCTGGGCACCCCACCCGCTTCTCGGG 

TGTTATCACACAGGGCAGGAACTCTGTCTGGAGGTATGACTGGGTCACATCATACAAGGT 

CCAGTTCAGCAATGACAGTCGGACCTGGTGGGGAAGTAGGAACCACAGCAGTGGGATGG 

ACGCAGTATTTCCTGCCAATTCAGACCCAGAAACTCCAGTGCTGAACCTCCTGCCGGAGC 

CCCAGGTGGCCCGCTTCATTCGCCTGCTGCCCCAGACCTGGCTCCAGGGAGGCGCGCCTTG 

CCTCCGGGCAGAGATCCTGGCCTGCCCAGTCTCAGACCCCAATGACCTATTCCTTGAGGCC 

CCTGCGTCGGGATCCTCTGACCCTCTAGACTTTCAGCATCACAATTACAAGGCCATGAGGA 

AGCTGATGAAGCAGGTACAAGAGCAATGC C C CAACAT CAC C CGCAT C TACAGCATTGGGA 

AGAGCTACCAGGGCCTGAAGCTGTATGTGATGGAAATGTCGGACAAGCCTGGGGAGCAT 

GAGCTGGGGGAGCCTGAGGTGCGCTACGTGGCTGGCATGCATGGGAACGAGGCCCTGGG 

GCGGGAGTTGCTTCTGCTCCTGATGCAGTTCCTGTGCCATGAGTTCCTGCGAGGGAACCCA 

CGGGTGACCCGGCTGCTCTCTGAGATGCGCATTCACCTGCTGCCCTCCATGAACCCTGATG 

GCTATGAGATCGCCTACCACCGGGGTTCAGAGCTGGTGGGCTGGGCCGAGGGCCGCTGGA 

ACAACCAGAGCATCGATCTTAACCATAATTTTGCTGACCTCAACACACCACTGTGGGAAG 

CACAGGACGATGGGAAGGTGCCCCACATCGTCCCCAACCATCACCTGCCATTGCCCACTT 

ACTACACCCTGCCCAATGCCACCGTGGCTCCTGAAACGCGGGCAGTAATCAAGTGGATGA 

AGCGGATCCCCTTTGTGCTAAGTGCCAACCTCCACGGGGGTGAGCTCGTGGTGTCCTACCC 

ATTCGACATGACTCGCACCCCGTGGGCTGCCCGCGAGCTCACGCCCACACCAGATGATGC 

TGTGTTTCGCTGGCTCAGCACTGTCTATGCTGGCAGTAATCTGGCCATGCAGGACACCAGC 

CGCCGACCCTGCCACAGCCAGGACTTCTCCGTGCACGGCAACATCATCAACGGGGCTGAC 

TGGCACACGGTCCCCGGGAGCATGAATGACTTCAGCTACCTACACACCAACTGCTTTGAG 

GTCACTGTGGAGCTGTCCTGTGACAAGTTCCCTCACGAGAATGAATTGCCCCAGGAGTGG 

GAGAACAACAAAGACGCCCTCCTCACCTACCTGGAGCAGGTGCGCATGGGCATTGCAGGA 

GTGGTGAGGGACAAGGACACGGAGCTTGGGATTGCTGACGCTGTCATTGCCGTGGATGGG 

ATTAACCATGACGTGACCACGGCGTGGGGCGGGGATTATTGGCGTCTGCTGACCCCAGGG 

GACTACATGGTGACTGCCAGTGCCGAGGGCTACCATTCAGTGACACGGAACTGTCGGGTC 

ACCTTTGAAGAGGGCCCCTTCCCCTGCAATTTCGTGCTCACCAAGACTCCCAAACAGAGG 

CTGCGCGAGCTGCTGGCAGCTGGGGCCAAGGTGCCCCCGGACCTTCGCAGGCGCCTGGAG 

CGGCTAAGGGGACAGAAGGATTGATACCTGCGGTTTAAGAGCCCTAGGGCAGGCTGGAC 

CTGTCAAGACGGGAAGGGGAAGAGTAGAGAGGGAGGGACAAA 
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Figure 8 



>CG54007-01 20190 nt 
ATGTGGGGGCTCCTGCTCGCCCTGGCCGCCTTCGCGCCGGCCGTCGGCCCGGCTCTGGGG 
GCGCCCAGGAACTCGGTGCTGGGCCTCGCGCAGCCCGGGACCACCAAGGTCCCAGGCTCG 
ACCCCGGCCCTGCATAGCAGCCCGGCACAGCCGCCGGCGGAGACAGCTAACGGGACCTCA 
GAACAGCATGTCCGGATTCGAGTCATCAAGAAGAAAAAGGTCATTATGAAGAAGCGGAAG 
AAGCTAACTCTAACTCGCCCCACCCCACTGGTGACTGCCGGGCCCCTTGTGACCCCCACT 
CCAGCAGGGACCCTCGACCCCGCTGAGAAACAAGAAACAGGCTGTCCTCCTTTGGGTCTG 
GAGTCCCTGCGAGTTTCAGATAGCCGGCTTGAGGCATCCAGCAGCCAGTCCTTTGGTCTT 
GGACCACACCGAGGACGGCTCAACATTCAGTCAGGCCTGGAGGACGGCGATCTATATGAT 
GGAGCCTGGTGTGCTGAGGAGCAGGACGCCGATCCATGGTTTCAGGTGGACGCTGGGCAC 
CCCACCCGCTTCTCGGGTGTTATCACACAGGGCAGGAACTCTGTCTGGAGGTATGACTGG 
GTCACATCATACAAGGTCCAGTTCAGCAATGACAGTCGGACCTGGTGGGGAAGTAGGAAC 
CACAGCAGTGGGATGGACGCAGTATTTCCTGCCAATTCAGACCCAGAAACTCCAGTGCTG 
AACCTCCTGCCGGAGCCCCAGGTGGCCCGCTTCATTCGCCTGCTGCCCCAGACCTGGCTC 
CAGGGAGGCGCGCCTTGCCTCCGGGCAGAGATCCTGGCCTGCCCAGTCTCAGACCCCAAT 
GACCTATTCCTTGAGGCCCCTGCGTCGGGATCCTCTGACCCTCTAGACTTTCAGCATCAC 
AATTACAAGGCCATGAGGAAGCTGATGAAGCAGGTACAAGAGCAATGCCCCAACATCACC 
CGCATCTACAGCATTGGGAAGAGCTACCAGGGCCTGAAGCTGTATGTGATGGAAATGTCG 
GACAAGCCTGGGGAGCATGAGCTGGGGGAGCCTGAGGTGCGCTACGTGGCTGGCATGCAT 
GGGAACGAGGCCCTGGGGCGGGAGTTGCTTCTGCTCCTGATGCAGTTCCTGTGCCATGAG 
TTCCTGCGAGGGAACCCACGGGTGACCCGGCTGCTCTCTGAGATGCGCATTCACCTGCTG 
CCCTCCATGAACCCTGATGGCTATGAGATCGCCTACCACCGGGGTTCAGAGCTGGTGGGC 
TGGGCCGAGGGCCGCTGGAACAACCAGAGCATCGATCTTAACCATAATTTTGCTGACCTC 
AACACACCACTGTGGGAAGCACAGGACGATGGGAAGGTGCCCCACATCGTCCCCAACCAT 
CACCTGCCATTGCCCACTTACTACACCCTGCCCAATGCCACCGTGGCTCCTGAAACGCGG 
GCAGTAATCAAGTGGATGAAGCGGATCCCCTTTGTGCTAAGTGCCAACCTCCACGGGGGT 
GAGCTCGTGGTGTCCTACCCATTCGACATGACTCGCACCCCGTGGGCTGCCCGCGAGCTC 
ACGCCCACACCAGATGATGCTGTGTTTCGCTGGCTCAGCACTGTCTATGCTGGCAGTAAT 
CTGGCCATGCAGGACACCAGCCGCCGACCCTGCCACAGCCAGGACTTCTCCGTGCACGGC 
AACATCATCAACGGGGCTGACTGGCACACGGTCCCCGGGAGCATGAATGACTTCAGCTAC 
CTACACACCAACTGCTTTGAGGTCACTGTGGAGCTGTCCTGTGACAAGTTCCCTCACGAG 
AATGAATTGCCCCAGGAGTGGGAGAACAACAAAGACGCCCTCCTCACCTACCTGGAGCAG 
GTGCGCATGGGCATTGCAGGAGTGGTGAGGGACAAGGACACGGAGCTTGGGATTGCTGAC 
GCTGTCATTGCCGTGGATGGGATTAACCATGACGTGACCACGGCGTGGGGCGGGGATTAT 
TGGCGTCTGCTGACCCCAGGGGACTACATGGTGACTGCCAGTGCCGAGGGCTACCATTCA 
GTGACACGGAACTGTCGGGTCACCTTTGAAGAGGGCCCCTTCCCCTGCAATTTCGTGCTC 
ACCAAGACTCCCAAACAGAGGCTGCGCGAGCTGCTGGCAGCTGGGGCCAAGGTGCCCCCG 
GACCTTCGCAGGCGCCTGGAGCGGCTAAGGGGACAGAAGGATTGANNANTNCANNTTNAN 
NNTNGNNANNTCTCACTTATAAATGGAAGCTGGCGGGACACGGTGGCTCACTCCTGTAAT 
CCCAACACTTTGGGAGGCTGAGGCGGGTGGATCACGAGGTCAGGAGATCGAGACCATCCT 
GACTAACACGGTGAAACCCGTCTCTACTAAAAACACAAAAAATTAGCTGGGCGTGGTGGC 
GGCACCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATGGCATGAACCCAGGAG 
TCGGAGCTTGCAGTGAGCCGAGTTCACGCCACTGCATTCCAGCCTGGGCAACAGAGCGAG 
ACTCTGTCTCAAAAAAAATAAATTAAATAAAAATAAATAAATGGAAACTAAGCTGTGGGT 
ATGCAAAGGCATACAGAATGGTATAATGGACATTGGAGACTCAGAAGGAGGAGGGTAAGC 
GGGGGGTGACAGATAAAAAAAACTGCATGTTGCATACAATGTACACTACTCGGGTGATGG 
GCGCTCTAAGATTTCAAACTTCACCACTATACAGTTCTCCCCTGTAACCAAAAACCGCTG 
GTACCCCTAAAGCAATTGAAATAAAAATAGAAACTATGTTGTAGCCTGGATGACATAGCG 
AAAACTTGTCTCTTAAAAAAAAAAAAATGTGGCCGGGTGCAGTGGCTCACACCTGTAATC 



CCAGCACTTTGGGAGGCCCAAGGCGGGCAGATCACAAGGTCAGGAGATTGAGACCGTCCT 
GGCTAACAAGGTGAAACTCCATCTCTACTAAAAATACAAAAAATTAGCCGGGTGTGGTGG 
CACACGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCAGA 
GGCGGAGGTTGCAGTGAGCCGAGATCGCACCACAGCACTCCAGCCTGGTGACAGAGTGAG 
ATTTAGTCTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGTAGAAATTAGCTGAGCGTG 
GTGACACGTCCCAGATACTTGGGAGGCTGAGGTGGGAGGATCGCTTGAACCCAGGAGTTC 
CAGACTGCAGTGAGCTGTGATTACACTATTGCACTCCAGCCTAGGCTGTGGGAAAGAGAG 
TTTCTGGGGTGCCAGCTGAGTTAGTCTTCCCTGTGTGAGACACCCATGGGAAGCCATGCG 
CGGCCTCTGAGGAGAAAAGTCTCCTTATTGCCTTCATGTCTTTACGCCCGAGAGCAGAAC 
CCCTCAGCGGCATTCCACAGGTTGCTCAGGCATATAACACTCCCTTGAAGCAGTGGAGTA 
TAATCAAACATCTTGGCTCCTCCTGAAACCCACTCCCACCCGTTTCAGTCCCGATAAGTT 
AAAGATTTGTTTTGTTTTGTTTTTGTTTGAGACGGAGTCTCGCTCTGTCGCCCAGGCTGG 
AGTGCGGTGGCTCGATCTCGGCTCACTGCAAGCTCCGCCTCCCGGGTTCACGCCATTCTC 
CCGCCTCAGCCTCCCGAGTAGCTGGGACGACAGGCGCCCGCCACCACGCCCGGCTAATTT 
TTTGTATTTTTAGTAGAGACGGGGTTTCACCACGTTGGCCAGAGTGGTCTCGAACTCCTG 
ACCTCAAGCGATCCACCCACCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCAC 
CGCGCCCGGCCAGTTAAAGATCTTAAGTAGTTTGACACTCCTCTTTGCTCAAGGAAATTC 
ACAGAAACCGCCACTGCTATACATCTTACAGAATGACTCTCCAGTTCTCCTTCACTGATT 
AATCCTTTCCCTCATCCCTTCCTCCTCCTCCCATCTGCCCTAAGAACAAAGAGCTTGTAA 
ACCAATAAATTGGGCGGAGCCTGAGAACTCTGGGCCGTGAGCAAGCCTCCGACGCTCCGG 
TCCCCTGGACCCGCCTTTTAAACGCTTATTCTGTCTCTTTCTAACTCCTTTGTCTCCGCC 
GGACTCGGGGTAACCGCTAGGCGTTATGGGGCTGTTTTCCCCAACATAGGCAACAGAGCA 
GGACAGTGTCTCTAAAAAAACAAAACCAAAACTATATTTTGTACTATTCTGATAAAAATG 
ACTTAGTTACAAACAAAGAACAAATCAACAGATAGTCATGCTGTGGAGATCAGGAATATT 
CCTTCCCAGGGTAAATGAAAGACCAATTCCCTAACGTCATGTGGATATACGCTTGTGGCT 
TAAGATAAAATTACCCGTGACAGCATCAAATACCAGGGATAAAACTCAGTCTTCAACACG 
CATATGTATCTCCTGGGGTTGAATCCTCTGGAGGTCTTGTTAAAAATGCAGATTCTGGTC 
AAGAGTTCGAGACCAGCCTGGCCAATATGGTGAAACCCTGTCTCTACTAAAAACACAAAA 
ATTAGCTGGGTGTGGTGGTGGACGCCTGTAGTCCCAGCTACTCAGGAGACTGAGGCAGGA 
GAATTGCTTGAACCCGGGAGGTGGCAGTTTAGTGAGCTGAGATCGGGCCACTGCACTCCA 
GCCTGGGAGACAGAGTGAGACTCTGTCAAAAAAAAAAAAAAAAAAAAATGCATATTCTGA 
TTCAATAGGTCTGGGGCAGAGGTGTTTTTTTTGTTTGTTTGTTTTTTGTTTTTTGGTTTT 
TTTTTTGGTTTTTTTTTTTTGACAGAGTCTAGCTCTTTCACCTAGGCTGGAGTGCATGAC 
ACCATCCCAGCTCACTGCAACCTCCGCTTCTTGGGTTCAAGCGATTCTCCTGCCTCAGCC 
TCCTGAATAGCTGGGATTACAGGCGTGCACCACCACACCCAGCTAAGTTTTGTATTTGTA 
GTAGAGATGGGGTTTCACCGTGTTGGCCAGGTAAGTTTTGTATTTGTATTTGGTCTTGAA 
CTCCTGACCTCAGGTGATCCGCCCGCCTCGGCCTCCCAAAGTTCTGGGATTACAGGCGTG 
AGCCACTGCACCCGGCCTGTTCTGCATTTCTAACAAGTTCCCAGGGGATGCTACTGCTGC 
TGGTCTTCAACCACACTTTGTGGAGCAAGGCTCTCAAAGACCTTGATGTATGTAGGAGAG 
AAAGCTGGGGTAGAGAGTGATGAGGGGAGAACGGGTGCGTGGGGAGATGCTCCCCTGTGC 
ATCCTGGTCCCATGTGAGGCTCCAACAATGCTCACCTACATCACAGGGAGAGCACCTAGC 
AGGAAATGAGTTCTGCTTTAGCATCCAGGCACAGGAGATTAGAGGCACAGGCAGGCAGTA 
GATTCTACTTCATTATTTGTGCAGCTGGACACAGAGCTTCCTTTCTTTTCCTTGATACTG 
TTTTATTCCATCTAAGTATGTAGGAGTAAGAGGGCTGTGTTACACTGTTTTCCCCACCTT 
TAATGCATCTGATCAACCTAGGAGCCCCCTAAGACCCTATATTATCTCACTTTATCATCA 
CAGCAAACCTGGGAGAAGGATATGGTTCCTGTTTTACAGATGAGGAAACTAAGTCTCAGG 
GAGGTGAAACTACTGCCCAAGGATAGCCAAACAAAATACACGTCAGAAGTGGGATGTGAA 
ACGAAGCCTGTATGTCACCAGAGTCACCTATCCTCTCCCCCTCCAACCACCTAACCACAC 
CAGGGAGTTGGCAGGAGATTCCTAGCCCACCCCTTACATTAAAATCCCTTTTAGGCGGGT 
GCCACTATCCAGTCCTTCTCAATTGCACCTAGTGAGACCACGAAAGATCTTCTACCTGGC 
TCCTGGTAGATGAGATCTGGCTATACAGGTACTTGGGTGCAAACCTGCCCCTCTGCCCCT 
GGAGCTATCACCTCCAGATCCTGCTACTTGTACCTTTGCAGCCCCAGGTAGCCAGTGGCA 



AGGGCCAGGGGTGGCAGCAGGGCTGGGAGTGGAGAAGAGTGTGAGAAAGTGCTGCGGGGC 
TCAGGAGACACAGCAGGGAACCAAGGGGTCCTAAGGGTTGCAATAGAGGACAGGGGCAGG 
GAGTGCAGAGTGGTGGGAAGGGGGATGGGAGCTGGGTGCAGGAGACATAAGAGATGGAGC 
ATCCCGGCCACACACGGTGGCTCACACCTGTTATCCCAGCACTTTGGGAGGCCGAGGTGG 
GTGGATCACGAGGTCAGGAGATCGAGACCATCCTGGCTAACACGGTGAAACCCCGTCTCT 
ACTAAAAACACAAAAAATTAGCCAGGCGAGGTGGTGTGCACCTGTAGTCCCAGCTTCTTG 
GGAGGCTGAGGCAGGAGAATGGCGTGAACCCAGGAGGTGGAGCTTGCAGTCAGCTGAGAT 
CCCGCCACTGCATTCCAGCCTGGGTGACAGAGTGAGACTCGTCTCAAAAAAAAAAAGAAT 
AAAAGAAAAAAGAGGTGGAGCATCCTGCAGCCCTGGCCCCTAAAAGATTGGTGGGAGAGT 
GCCAGCTGCTCCACCCTAGTCACTTTGGGAACTGGTCTTTCAGTTCACGGCCTGCCATGT 
CCTCTCCTGCAAATCCTGGCACTGTTGAGGAGGTCCTTTCAGCCCTGGTTTGTCCACTCT 
AACCTTGAATATATTATACACACACTTTATGAGAGCTGACGAGGGACCAGGTGCTGTTCT 
AGGCTCTGAGGTGCAGCTGTGGACATTTGGGTACAAAGTTCTTCTGGCAGGGTACTTACC 
TCCTGCTGGGGGTGGGGGAACCTGAACAGCCAACACATAAGTAAAGCAAGATCATCTCGG 
TGTTGAGTGCCTTGAAGACAATAATTTAAACGGGTGGGAGGATAGAGTGTGTGAAGTGAA 
AAAGTTTGCTTTAGTCAGGGTAGTCAGGGAAAGCCTTTGGGAGCAGGTGATATTGAAAGG 
AAATCTGACTGAGAAGGCAAATTCCATGCACAAATTAAAAGGCCAGGAGGCTAGTTGGGC 
TGTTGCGTGGGAGGAGCAGCTAGAATGCCGGAGTGACTGGGGGGATGGGAGCCAGGGGAT 
AGGGAGGCAGATGGAATGGGAAAGGCGTGGGCAGGAAGAACTTGGTCATGAAGACCTTGC 
AGGTGAACCCACTGGGGCCTTAAGCCTGGAGGAACTTGACAGAATTTGCCTACTGTGTGG 
GGAACGGCTTGGAGGGGGTGTGGGCTTCAGGAGGCTGAGATGTCCTGTTTCTTGTGCCCC 
CTCCTTTCTTCCCAACACCCGAGAAACCTGGATGGGTGTGGGGACCAGAGACCTGGAGGT 
GGCCAGATTGGGCTTTGGCGGGACGCTTAGCAGCCCTCGGGACCTGTTCAGACTGCGGCC 
TCCCACCTTCGGGAAGCATCGGCGCTGCCCATCTGCCCCTGCCTGGCGTCCAGGGAGTCC 
CGGCTGTGCAGCGCTTCCCTTGAAATGTCTCTCTGTCCTCCCATCCAGTGCCTGGGACCC 
GGCAGCGCCGTCGAGGCAGGGGGCTGCGAGGCGGGACCCAGTTGCACGTGGGCCCTGTGG 
GGTCACTCCCTTTCGGGGGTCCTCTAGCTCTTCACCCTGCGCGCGTGGGGCAGACCAGAT 
GCCTCGAGGAGCTCCAGGACCAGTGCCTATGGGGTAGTCCCTGCCGGCGGTGGGCCCCAG 
TCCCAGACTGCGGCGCGCTATTTCTTTCTGGGGTTCGTGTGAGCGTGGGCTGCCAGAATG 
GTGCCCACAAGCTGCTTTTGGGTGATTCAAATCATTTATACAGATAGTGCCCCTGCAAAA 
AACATTTGCGCAGGGCCCCGCTTACGCCAGAGGATTGCGGGCCACTTCTGGGCATCGCTC 
CTCGTGGGGATGGGAGCATCTCCCTGGAGAGCCCTTTGCAAAGGCCAAGCGCCGGCCAAA 
GGCACACCGCTGGACGCGTTTCCTTCCTTCTGGAGAGATGACCAGGAATGCAGGATCCAA 
AGGGGGTCTTGGAGGGAGGGCGGGAAGGGCATCTCCGGATCTGGGCAGACCCAGGGCTGC 
CGGCTCCCCGAGGAGAATACGGGCTGGGGGCGAGGAGCCGGAGGGCAGGTCAGGCAGTGC 
ATCAACCCTTGGCTCCTCCACCGCAGCCCCAGCCCGCAGGCTATCGCTCAGGCTTCTCTC 
TCCGGGTTATGTAACCCCGGGACGGGACGTGGCAGCCGGGTGAGTGAGCGAAGGAGTAGG 
GGAGGGAAGGGAAAGGAGAGGAGGGGCAGGGCCGGGCTTGGTGATGGTGGTGGTGGGAAG 
CGCCGCCGTGCCGCCTCTTCTTGGGCCCCTTGGGTTGTCTTTCTGGAGGATTCCGGGACC 
AGCCCTCTCCCCAGGCTCCGGGTCGCCCCCTAGCCCCCCGCCGCCTCATTTTCCCTTCAC 
TCTTTTCCCCCTTCTGTCCCACCCGCCCTGCCAGGGGGCCTCTGGCTCTGGATAGCTTTT 
CCTCTCCGGTTGTAGTTTCCTTCCCAAAGTTCTCAGCTTTGCTACCTCGCCCAAGTCATT 
AGCCGCTCTGAGCCTCAGTTTATCAGTTTGTAAAATGAAGTTTGATTGAGCGGCCACGTG 
TAAAACTCCTGGCATAGTGCATGGTACAAAGTAGATGTCTGCTGCAGGCTAAGGGCCTCG 
AGGGGCTAAGTGAAATGTTGTGTGCCAGGCTGGGTGTCAGAGCCCCGGGAGCCGCAGCCA 
CGAATGGTTGGCTCCCGGGTGGTAAAAGAATTTATCAACAACAGTATAGGTTTGAAAAGT 
TTTATTAGATGGAAAGAACTCCACAGCAGAGCGCAGCGGGATGCTTCGGCAAGAGAGGCC 
TGAGCTCACTTGCAGGGAACTGAAGGGTAATTTTGACCACATTAGTTTTGTAGGTCATAG 
TAAATGATTACATTTGTAGACATTTTGGCACCTTGATGACAGCAAAGGTTGCACAATGGG 
TTCCAACATGCGTGCATTCCGGAGATGTATAGAAATTCTAGGGAAAGAAGCCTGGTACCA 
GATGTGGCTTTAGATAATAGGAAAGTACCATTCTGAGTTCTTCAGATAAGGTGCTTTGCC 
TCCTGATGGTCTGCTTGATGGCCACCAGGTGATCCTTGCTCTCCTCATTTTCCCCCTGAT 



AAATATTTTGGGCAAATCTTTGACCCTTTGTATTTCTCCATGCTCATGTCTACTTGTCTG 
TTAGGATCCCAAGAAAGGGAAAATGGCACAGTGAAGAGGGGTGTCCAGTCTATCTGGCTA 
CTTCCTGCTGAAAAGGGGCATTGAAAGGATTCCTTTCTTGCTTTCTGTCATGAAGGGAAT 
GAAGGGTCATGATAAACTTGTTCATGGAGGGAAGACCAGATTCCATCAAGAGGCCCCATG 
AAAATAGAAGTTGCTGTTGCAGGCTGGTATTGGGATTGCATAGTCATCTGTAGGTGGAAT 
CATTGTAAGCTGGAAGATATAAGCATTAAAAGGCAGGAATTACCGGCATGCACCTCCATG 
CCCACAGATTTTTGTGTTTTTAGTAGAGACAGGTTCTCACCATGTTGGCCAGGCTGGTCT 
CCAACTCCTGACCTCAGGTGATCCGCCCGCCTCGCCTTGGTCTCCCAAAGTACTAGGATT 
ACAGGTGTGAGCAACCACACCTGGCCCCTGGGGTCTCAATTTGTGTATTTATGCATGGCC 
TCCACCAGTCTAGCTTGGAAAAGGGCAGGGCTTTCAGATAGTTTCATACATACAAAATTA 
TTATTTCTTTTTATTTTATTTTATTTGAGATGGAATTTCGCTCTTGTTGCCCAGGCTGGA 
GTGCAGTGGCGCAATCTCAGCTCACCACAACTTCCGCCTCCAAGGTTCAAACGATTCTCC 
TGCCTCAGCCTCTGGAGTAACTGGGATTACAGGCATGCACCACCATGCCCAGCTCATTTT 
GTATTTTTAGTAGAGATGGGGTTTCTCCGTGTTGGCTAGGCTGGTCTCAAACCTCAGGTG 
ACCCGCTCGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACCGCGCCCAGC 
TATTATTTCTTATAATTTAGAAAAATTAACAGGTTTTATTATATATTTTTCATTCCCTCC 
AACAGAGAAGTTACCATATGATCCTGTCTGCCCTTACCTCTGTTTGGGCCAGAATTGGTG 
GCCTGGTATTGCCAATAGGTTCTATGTTGGGGACAGCTTCTGCCCAGCTCTGTTATTAGG 
ACTGGGAGCATGAGCTTCATCTGCCCATGCTGAAGATCACACGTGTGATTTTTTGTGTGT 
GGGAACAGCAGGTAGTTAATACCACAAATACATCTTGCCAGGTTAAATCAAAGGCAACAG 
TTAAAGTCTGAAATTCTTGAATGAACTTAGAGGGATCCTGACTAAATGAACCCAACTTGG 
ATTGAATTTGCAAAAGATCAGACATGATCAGAAAAGGGACATGAACTTGGCTTGTTCCCA 
AATCTTCATTAGCCACCTTAGGGAGAGGCAAAATATTTTGGGGATTTTTCTGAGGACTCT 
GTACTAGTAGCATATGTGACTCCCCTGAGAGTATGTGAAGGGGAGAAAGTATTTGGGTAT 
GTGGGTGGGAGATTGACTAGGGAATGGAGCAGATGGAGAGGGTGTAGGTGAAGAGTGAGC 
AGGTTGAGGAGGATGTAATAGGCAAAAGGAAGGATCATCTAAGACATCAGAACCGGGAAG 
GGAGGACGTTCCTTGGAAGCATACATGACAATTTGTATGTAATTTTGGGTTTGGATTTGG 
GGATAAAGCAAAAAAGACCTGAACATATGGGACTTCTGAATCCTTTCCAAGGTTCCGGCA 
AAAAATCAGTTAAGTTGTAAAGTAGCATTGCAATCCCAAGTTTCATTAATTGGCCAAATT 
GATTGATTAGGGAGCTTGTATTGAACCCAAGCAATATTAGAAAAAAGGATATGCTTTTTA 
AACTCTTATTTATTTTTTATTTGTATTTTTTGAGACAGAGTCTTGCTGTGTCGCCCAGGC 
TGGAGTGCTGTGGCGCCATCTTGGCCCACTGCAACCTCCGCCCCCGGGGTACAAGTGATT 
CTCCTGCCTCAGCCTCCCTAGTAGCTGGGATTATATGTGCCCGCCACATATAATTAGCCC 
CCTGGCTGATTTTTTTTTTTTTTTTGTATTTTTAGTAGAGACAGGGTTTCGCCATGTTGG 
CCAGGCTGATCTCGAACTCCTGACCTCAGGTGATCCACTCGCCTCGGCCTCCCAAAGTGC 
TAGGATTACAGGTGTGAGTCACTGTGCCCGGCCAAGTTTTGCATTTTTAGTAGACTCCCG 
GTCTTTAACTCCGGACCTCAGGTGATCTGCCTGCCTTGGCCTCCCAAAGTGCTGGGGTTA 
CAGGCATAAGCCATTGTGCTCAGCCTTATATGCTTATTTTTAAGAGTTTGTGGGTCAAAA 
TGAGACCAATGGGACCATTTTTAAGGAGGCAATCCAAGGGCGAGTTGGATGGAACTGAAT 
TAATTGAACCGAAGTTGGGTTTAGACAAGGAACTACAAGATCCCTGAGGCATCCCTGTGT 
AGAATTGAGATCCACCGCTTCCAGGACAAGGCTTATGGAGTGTTAAAATGAAAGTGCCCT 
GCCACTCTGACAGGCAATAGCTCTTTTGTCTTGGCCTTGGGGTAATACCGGGGGATGGCG 
CTTGGCCAGAAACTGTCAGTTGCCAACGAGAACTCAAGCTGGTTCACTGGCAGTCCGAAA 
ACAGAAAAGAGCCCTGGCCAGTCCCTCACCCCTAAGGGCAAGGACAGCCAGGTATCCCTT 
CTCTAGGGCTTCAGGATCCCACAGAAGAGCTGCCTCCACCGGGACCGGCAGTTCCCCAAA 
GAGTAAAGAACCAGACCGTGGAAGGAAGCAGAGAGAAAAAGGAAGAGGGAAATCCCAGTG 
AAGTCCCCGTATGGGCCACCAAGATGCCAGGCGAGGTGTCAGAGCTCCGGAACCGGGAAG 
TGGTTGGCTCCCGGGTGGTAAAAGAACTTATCAACAACCGTGTAGGTCTGAAAAGGAAAG 
TTTTATTAGACGGAAAGGACGAGGCAGCAGAGCGCAGTAGGCGCTTCAGCAAGAGAGGAC 
TGAGCTCCCTGCGGGGAACTGCAGGGTAATTTGGACCACATTAGTCACTTAGGTCATGGT 
AAATGGTTACATTTGTCGATATTTTGGTGCCTTGATGTCAGCAAAGTTTGCACAATGGGT 
CTTAACGTGCACTCATTCCGGAAACGTACAGAAATTCTAGTTACTTATAAATTCTTGGGA 



CGGAAGCTTGGTACCAGATGTGGCTTTAGACAATAGGGAAGTGTCATTCTGAATTGCTCA 

GATAAGGGGCTTTGCCTCCTGTTGGTCGACTTGATGGCCACCAGGTGATCTCTGGTCTCT 

TCAGTGTGGCTTTGCAGACTATAAAGGCGCAGCGCGCCAACGAGGCGGGTTGGCCCCAGA 

CGGCGGAGAGGAAGGGCAGAGTCGGCGGTCCTGAGACTTGGGGCGGCCCCTTGGAGGTCA 

GCCCCGCTCGCTCCTCCCGGCCCTCTCCTCCTCTCCGAGGTCCGAGGCGGGCAGCGGGCT 

GTGGGCGGGCAGGAGGCTGCGGAGGGGCGGGGGGCAGGAAGGGGCGGGGGGCTCGGCGCA 

CTCGGCAGGAAGAGACCGACCCGCCACCCGCCGTAGCCCGCGCGCCCCTGGCACTCAATC 

CCCGCCATGTGGGGGCTCCTGCTCGCCCTGGCCGCCTTCGCGCCGGCCGTCGGCCCGGCT 

CTGGGGGCGCCCAGGAACTCGGTGCTGGGCCTCGCGCAGCCCGGGACCACCAAGGTCCCA 

GGCTCGACCCCGGCCCTGCATAGCAGCCCGGCACAGCCGCCGGCGGAGACAGCTAACGGT 

GAGTTCCCCGACCGACGGTCCGCTCCCCCGCAAGCCGACTGCCCGGCTCTCCTGCCCCGT 

GGGGCGATCCCTCCCTAACACGCGGGCACACGCACACCCACGCACACTCACAGTCATGCA 

CACTCACCCCGCACGCACACTCGCACTCACGCGCACACACGCGCGCGCACTCACACACAT 

TCACACACGCGCACACTTGCACTCACACGCGCGCGCATTCACACGCATGCACACACACGC 

ACACTCACACGCGCGTGCGCGCACACACAGTGCACGCGCGCGCACACTCACACTCACAGT 

GCACACACACATATACACACTCACACTCCCTCAACTCCCTGCTGGGAGCAATGGCTGCTG 

ACTCGGCAGCCCCAGTTCCCTGCCAGACCTAGTCAGCAGTCCCAGGACAGGCGCCAGTGG 

GATGCTGCCTCTTCCAAGCCCCAAACCTTCCCTTTTCACCAAAGACAAAACAGGCCAGAA 

CTGGCAGGAGGGGAGACAGAGGGGCAGAAGCTCTCAAGGTGCAGAGCAAGACTGCGTAGG 

AGAGAGTTTGAAGGCGAGGGCTGGAGAGAAAGAACAAA&GGAAAGAAGGGAGAGCCCCTC 

GCTGAGGCTGCCGGGAGGATGGGGCAGAGCGGGAGAGGAAGGCAGCCCGACCTCCCAGCT 

TTCCAGATGTGGAATAGGAGAGGAGGAGCGCAAGCGGAGGGCACTCAGGGGCTTCTAGAG 

GAGGCAAGTGGAGGAGGGTCTTGAAGGGTGATGTCCCCGAGTCAGGGGAGTCTGGAGAGA 

GAGAGAGAGAGAGGGCTGCCAAGAAGGAAGCGGCGGGCAAAGGCACAGGGGCACCAGATG 

CGGAAATGGGCAGCCTGTTCTGGAGGCAGCTGTGGAGCTTCGATGGGTACCCCCAGCACC 

TGCCTGGGCAGAGCCTTGTGCTGAAGGGCCGGCGGGCAGGCCCAGCCCTGAAAGCCTCGA 

CACCCAGGCAGACATGGATTCCAGGACAGGCCATCTGAGCCCAGAGAGCAGACACAACAA 

TGGAAGCGGCACAGGGGTTTTGGGGCATGATGCTGAGTCTGGAGCTAAGAAAGCCTCCTT 

GGAAAGGCATCTGGGCTGAGATGCAAAGGAAGAATGGGAATTAGGTGAAAAAATCAGAGG 

CGAGGGGTAGCATTACAGGGGAGGGGATAGCTAGTGCAGAGGCCCGGAGGTAAAGTGCCA 

GACTCAGCTCTTTGGAGCAACCGAACAGTTTCTAGAGGCTGGGTGCAGCTCTCCATTGGA 

TTAGAGGTTCACAGGGGAGGCTGGCCAAGCATGTAGTTACATCAGGGAGGAGAAGGAGGA 

GCCAAGGAAGTGACTGGAGAGGCAGGTTGGGGTCAGATTGCAGGCCTTTGATGTCCTGTG 

AAGGCTGTTAGATCCTGGTGGTGTGGCCTGCTGTGGGCTCACATGTCTTCTTGGGCTGGC 

AGACCTTTCCATCCGGGGTTTCACCATTCTTCCTTTCCCCCATGCTGTGCCTCTCGGACC 

CCAAGGGACCTCAGAACAGCATGTCCGGATTCGAGTCATCAAGAAGAAAAAGGTCATTAT 

GAAGAAGCGGAAGAAGCTAACTCTAACTCGCCCCACCCCACTGGTGACTGCCGGGCCCCT 

TGTGACCCCCACTCCAGCAGGGACCCTCGACCCCGCTGAGAAACAAGAAACAGGTACTTC 

CTCTCCAGGGGCCCAGCCCAGACTTGCAGCCCCTGGGGCACTTTACCAGCACAGCTCTTG 

GCCTCATGGGCACCGGCACGCCCCTTGCTTGCCTAGCGCAGGAGCAACCTTAGGCTCAGC 

TTCCCACCTGCCCTGGCTACCCTCCCTCTGGTCCTGTCTCACTGTTCTATCCCCGCCCCA 

GGCTGTCCTCCTTTGGGTCTGGAGTCCCTGCGAGTTTCAGATAGCCGGCTTGAGGCATCC 

AGCAGCCAGTCCTTTGGTCTTGGACCACACCGAGGACGGCTCAACATTCAGGTCAGTAAT 

CCTGGCTCGGAGCCATGGTCTCAGGGTAGGGAAGGCAGCCCCTGGGAGCTTCTCTCCTGC 

CTCCTCTCTGTCCTGGCCTGCCCCACTCTGTCCAACTGGGCCTGACCACCATGTCCTGTG 

TCTGCAGTCAGGCCTGGAGGACGGCGATCTATATGATGGAGCCTGGTGTGCTGAGGAGCA 

GGACGCCGATCCATGGTTTCAGGTGGACGCTGGGCACCCCACCCGCTTCTCGGGTGTTAT 

CACACAGGGCAGGAACTCTGTCTGGAGGTGAGGCAGACTAACCCTAGGTCAGGAGGTCAC 

AGAAGGACTGGGGTGGGAGTCCTGGGGGCACCGATGATCTCTCTCCACCTCTCCTGCCAG 

GTATGACTGGGTCACATCATACAAGGTCCAGTTCAGCAATGACAGTCGGACCTGGTGGGG 

AAGTAGGAACCACAGCAGTGGGATGGACGCAGTGAGTGGTCCCACTGTGGCTGGGGCCTC 

CATGCTGGGAGTTGGGCACCCAGTCCAGGCTAGGCTGAGGCTCCTCTGAGGACAAGGAAT 



AGACGCCAGCTTAGGCTTCCCAGGGGGGTGTGGCTTGTTGTCAAGAGGGTGGCACACGGC 

AGGCACCATTGGGAGCCAGCTGCTTTGGGACATGCCCACATCCTCCCCAGATAATGCCAC 

CACAGGGTGGGTGCTGCTTCACGGTACAGCTTCCTCCTGGCGTGCCCCTTCTGGCCCGGG 

GCCTCTGGTCCACATCACTTCTTGCCTTCTCGTGGTTCTGACTTCCGCATCTCATGGACC 

TCTTTTTACAGCAGGCTACAATGTGGAGTCCTGGCCAGCTCTAGGATTGGCTTCCCCCGA 

GTCATGTGGCCAAACTGGTCTAATGAACTGTGTCCAATCCAGAGAGCAAGGCTGCCTAGG 

GCTGCCCATTGGCAGGGGCTGTGGGCCGGGGTCTGTGTTTGATGCACAGTGCAAGTCTCT 

AGCTGAGCCCACTAGGGTGGGGAGACAGTAAGCTTGGAGGCCTGAGCTCCTTCCCTGGGT 

CCTGGGCCAGGCTTCTGGGGTTTGAGCAGCCACAACAGAGAACTTGCTGCCCCCAGGTAT 

TTCCTGCCAATTCAGACCCAGAAACTCCAGTGCTGAACCTCCTGCCGGAGCCCCAGGTGG 

CCCGCTTCATTCGCCTGCTGCCCCAGACCTGGCTCCAGGGAGGCGCGCCTTGCCTCCGGG 

CAGAGATCCTGGCCTGCCCAGTCTCAGGTGGGCAGTCAGGCCAGGGTTGGTTGGGCAGGG 

CTTGGATGCAGGGTGCATCCTTCACTGTGGACACACCCTTTACCATAAACTCAACCTCCA 

CCAGACCCCAATGACCTATTCCTTGAGGCCCCTGCGTCGGGATCCTCTGACCCTCTAGAC 

TTTCAGCATCACAATTACAAGGCCATGAGGAAGGTCAGATATAACCCCTATGACCTGGGA 

AGGAGGGCCCACCCATCTCAGGTCCCCTTCCCACCTTCCCACCGGGGCACAACCTGCTGT 

GACTGCGCTTGTATGCCCCTGCTGCCTCCTGATGTCTCAGCCTTCTCTCCTGTGGACCCC 

TAAGCTCCATCCCACTTTCCCTTATTATGGCGCCCCCCCAGTCCTACCCCTTCCTCCCGG 

CTCTGCTGCCGCTCCCCTCCTGTACCATGATGGGATGCCCCCTCTGTGTGGGCCATCGCT 

GACTTTTTAAGTCTTTCCATGGCACATGTGATCTGCCCCTGGGTGTACCCCTCCCATGCC 

TCATGCCACGCTACACTCTGCCCACCAGCTGATGAAGCAGGTACAAGAGCAATGCCCCAA 

CATCACCCGCATCTACAGCATTGGGAAGAGCTACCAGGGCCTGAAGCTGTATGTGATGGA 

AATGTCGGACAAGCCTGGGGAGCATGAGCTGGGTACTGGCATGGGGAGTGGGGAGAGGTA 

GGCACAGGGCAGGGCCCCAGGCATGAACCCGCTGCAAGCCCCCATGTGTCCCCAGGGGAG 

CCTGAGGTGCGCTACGTGGCTGGCATGCATGGGAACGAGGCCCTGGGGCGGGAGTTGCTT 

CTGCTCCTGATGCAGTTCCTGTGCCATGAGTTCCTGCGAGGGAACCCACGGGTGACCCGG 

CTGCTCTCTGAGATGCGCATTCACCTGCTGCCCTCCATGAACCCTGATGGCTATGAGATC 

GCCTACCACCGGGTAGGCCACCCAGCATGAGGGCCACTCTGTCCTTCTGCCCTGGTGGCT 

GGACCTGCTCGACTTGAACAAGCCTCTTGCCCGGCAGGGTTCAGAGCTGGTGGGCTGGGC 

CGAGGGCCGCTGGAACAACCAGAGCATCGATCTTAACCATAATTTTGCTGACCTCAACAC 

ACCACTGTGGGAAGCACAGGACGATGGGAAGGTGCCCCACATCGTCCCCAACCATCACCT 

GCCATTGCCCACTTACTACACCCTGCCCAATGCCACCGTGAGTATTTTGAGGGCGGCAGT 

GGAGGTCTGTGGGGGGCGGACCTTGTCTCTGTCTCCTGCCCCTCCTGACCTGCCCCATCC 

AGGTGGCTCCTGAAACGCGGGCAGTAATCAAGTGGATGAAGCGGATCCCCTTTGTGCTAA 

GTGCCAACCTCCACGGGGGTGAGCTCGTGGTGTCCTACCCATTCGACATGACTCGCACCC 

CGTGGGCTGCCCGCGAGCTCACGCCCACACCAGATGATGCTGTGTTTCGCTGGCTCAGCA 

CTGTCTATGCTGGCAGTAATCTGGCCATGCAGGACACCAGCCGCCGACCCTGCCACAGCC 

AGGACTTCTCCGTGCACGGCAACATCATCAACGGGGCTGACTGGCACACGGTCCCCGGGA 

GTATGTGCCTGAGGGTGGAGTTAGCCCTGGCCCCGTAACCCCCGCCCTGATAAGACAGCC 

TGCGGTTGCGTACAGTGCTGGCGTCTGTTCCCACTCTGAAGTGTCCCTCAGAGAAGGGAG 

GGTAGCGGGAGGATGGGACCGCATCCCGCCTGCTTAGGCAGCAGTGTCTGTGGTCCCCTT 

AGGCATGAATGACTTCAGCTACCTACACACCAACTGCTTTGAGGTCACTGTGGAGCTGTC 

CTGTGACAAGTTCCCTCACGAGAATGAATTGCCCCAGGAGTGGGAGAACAACAAAGACGC 

CCTCCTCACCTACCTGGAGCAGGTCGGATCTGCGTCCCGGCCCCCAGCCTGCCTGAATCA 

CTCCTGCTGTCCATTTAGGCTACAGCTCCTACCAGGGGTTCTTCTAAGGTCCAGCTGAGC 

ATTCAGACTCACAAGATGCCATGGGCCATGCTTGGTATCAGATTGTCTTGGAAGCACACA 

GGACAGGAAGTGCAGTTTGCTGGCAGCGTGGCATCGTGTTAGAGCCGGTGGGAGGAGCCT 

CCATTGCAGTCTAGGTGGTGGTCCGTGGCGCTGCCCCAGAGCTATCCTCAGGAGAGACTC 

ACGTGAGGCAGGTGCAGGAGCTGTCCTGGCATAGAAGCTTCATGTTCCATGGAGCTCATA 

ACCCTTGTAATAGCTCCATAAGCAGAGCTTCCAAAGGGTCTACCAAAGACAAGCCCAATA 

ACCTGGGAAAGCCCAAGGATAGATAAGCCTTCCTACCAGGTATTTATCATTTTCTTAGTC 

CAGATGTGATTTGTCAATCAGGATTTCTTTTTTTTTTTTCTTCCAGAAGTAGTGTCACCT 



AGGAACACAGTAGACCTACCACTTTGCTCAGGTTTGCAGGGCAACAGAGCCAGCAAGTTA 
GCTAAACAGCACATTATCCTGCCGAAGGGGAAGGGCTCTGATAACCTCTTCCCACACAGG 
TGCGCATGGGCATTGCAGGAGTGGTGAGGGACAAGGACACGGAGCTTGGGATTGCTGACG 
CTGTCATTGCCGTGGATGGGATTAACCATGACGTGACCACGGGTGTGTTTGACCGGGAGG 
GCAAGGGAAGGGGCTGGAGGGCTGGAGGCTCGGGAAGAAGCAGAAGATCATTAATTGGGT 
CCTGATCGTGCCCTTCACTCTCCTCAGCGTGGGGCGGGGATTATTGGCGTCTGCTGACCC 
CAGGGGACTACATGGTGACTGCCAGTGCCGAGGGCTACCATTCAGTGACACGGAACTGTC 
GGGTCACCTTTGAAGAGGGCCCCTTCCCCTGCAATTTCGTGCTCACCAAGACTCCCAAAC 
AGAGGCTGCGCGAGCTGCTGGCAGCTGGGGCCAAGGTGCCCCCGGACCTTCGCAGGCGCC 
TGGAGCGGCTAAGGGGACAGAAGGATTGATACCTGCGGTTTAAGAGCCCTAGGGCAGGCT 
GGACCTGTCAAGACGGGAAGGGGAAGAGTAGAGAGGGAGGGACAAAGTGAGGAAAAGGTG 
CTCATTAAAGCTACCGGGCACCTTAGCTCATCTTCGTGTTGTCTCTGTGCCCCAGGTCCT 
CCCCCCGGGGGCGGGCCTCGGCCCAGCCCTCAGTTCCTATTCTGCACACTTGCACACTCT 
CATCAGTTGGCTTCTGGACACATTGTGTGAAAAGAGGATCCCACCTGGGCTCTTCTTGAA 
CCAAGGGCCTGGCAGAGCAACTCATTTCTTCTGATCAGCTTCTGCTACAGGTACCATTAC 
ACTGCTGCCAGGCATTCTGTAAGCGCCTGCTCATTGCCAGGTGTGCAAGGAATCAGGATC 
AGCCGTGCCTGCACTCAAACTCCTGGGGCTCCTAGTCAAGGGAAAGGACAGTTCGGTACA 
TTGTGAGACATGCTAGGGTGGAGGCCAGGTGCCGTGAGAGTGCAGGGGAGCTGCACACGT 
GAAATACAGCACTGCACATCAACAGGACTGGGGCAGTCAAGGATGCAATAGAAGTAGTGG 
CTCTAGAAGTTCAGGCGGGAGGTGGGCAGGGTGTGGAGTATGGACAGGGATGGCTCCAAG 
GAGGAGGGTCAGCCAAAGGTGGGTCAGCTGAGAACATTTGAATTTGCTTCAGCCATTCTC 
AGAGTATTGATAACTGATAGGCTTTGCTGAGTTTCTATCAGACTGAAGGGGAAGTTGTGT 
ATCAGTCTGTGTCTTGCCAGGTAAACAACCCATTCTAGGCACTTAAAGTGGAGGGAAATT 
TAATGCTGGAAATTGGATAGGAAGGTGTTGGAAGAGCTGGATGAGGCCGGGTGTGGTGGC 
TCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGGGAGGATTGCTTGAGCCCAGGA 
GTTTGAGACCAGCCTGGATAACATAGCCAAACCCCGCCTCTACAAAAATAAGAAATAAGA 
AACATAGCCAGCTGTAGTGGCGCATGGCTAAGGGAGGCAGAGGCAGGAGGATCACTGGAG 
CCTGGGAGGTGGAGGCTGCAGAGGCAGCAGTGAGCCATGATGGCGCCACTATACTCCAAC 
CTGGATGGTCATAACAAAATAAACAAAAAA (SEQ ID N0:3) 



FIG 9 
>CG54007-04 

ATGTGGGGGCTCCTGCTCGCCCTGGCCGCCTTCGCGCCGGCCGTCGGCCCGGCTCTGGGG 60 

GCGCCCAGGAACTCGGTGCTGGGCCTCGCGCAGCCCGGGACCACCAAGGTCCCAGGCTCG 120 

ACCCCGGCCCTGCATAGCAGCCCGGCACAGCCGCCGGCGGAGACAGCTAACGGGACCTCA 180 

GAACAGCATGTCCGGATTCGAGTCATCAAGAAGAAAAAGGTCATTATGAAGAAGCGGAAG 240 

AAGCTAACTCTAACTCGCCCCACCCCACTGGTGACTGCCGGGCCCCTTGTGACCCCCACT 3 00 

CCAGCAGGGACCCTCGACCCCGCTGAGAAACAAGAAACAGGCTGTCCTCCTTTGGGTCTG 360 

GAGTCCCTGCGAGTTTCAGATAGCCGGCTTGAGGCATCCAGCAGCCAGTCCTTTGGTCTT 420 

GGACCACACCGAGGACGGCTCAACATTCAGTCAGGCCTGGAGGACGGCGATCTATATGAT 480 

GGAGCCTGGTGTGCTGAGGAGCAGGACGCCGATCCATGGTTTCAGGTGGACGCTGGGCAC 540 

CCCACCCGCTTCTCGGGTGTTATCACACAGGGCAGGAACTCTGTCTGGAGGTATGACTGG 600 

GTCACATCATACAAGGTCCAGTTCAGCAATGACAGTCGGACCTGGTGGGGAAGTAGGAAC 660 

CACAGCAGTGGGATGGACGCAGTATTTCCTGCCAATTCAGACCCAGAAACTCCAGTGCTG 72 0 

AACCTCCTGCCGGAGCCCCAGGTGGCCCGCTTCATTCGCCTGCTGCCCCAGACCTGGCTC 780 

CAGGGAGGCGCGCCTTGCCTCCGGGCAGAGATCCTGGCCTGCCCAGTCTCAGACCCCAAT 840 

GACCTATTCCTTGAGGCCCCTGCGTCGGGATCCTCTGACCCTCTAGACTTTCAGCATCAC 900 

AATTACAAGGCCATGAGGAAGCTGATGAAGCAGGTACAAGAGCAATGCCCCAACATCACC 960 

CGCATCTACAGCATTGGGAAGAGCTACCAGGGCCTGAAGCTGTATGTGATGGAAATGTCG 1020 

GACAAGCCTGGGGAGC!ATGAGCTGGGGGAGCCTGAGGTGCGCTACGTGGCTGGCATGCAT 1080 

GGGAACGAGGCCCTGGGGCGGGAGTTGCTTCTGCTCCTGATGCAGTTCCTGTGCCATGAG 1140 

TTCCTGCGAGGGAACCCACGGGTGACCCGGCTGCTCTCTGAGATGCGCATTCACCTGCTG 1200 

CCCTCCATGAACCCTGATGGCTATGAGATCGCCTACCACCGGGGTTCAGAGCTGGTGGGC 1260 

TGGGCCGAGGGCCGCTGGAACAACCAGAGCATCGATCTTAACCATAATTTTGCTGACCTC 1320 

AACACACCACTGTGGGAAGCACAGGACGATGGGAAGGTGCCCCACATCGTCCCCAACCAT 1380 

CACCTGCCATTGCCCACTTACTACACCCTGCCCAATGCCACCGTGGCTCCTGAAACGCGG 1440 

GCAGTAATCAAGTGGATGAAGCGGATCCCCTTTGTGCTAAGTGCCAACCTCCACGGGGGT 1500 

GAGCTCGTGGTGTCCTACCCATTCGACATGGTGACTGCCAGTGCCGAGGGCTACCATTCA 1560 

GTGACACGGAACTGTCGGGTCACCTTTGAAGAGGGCCCCTTCCCCTGCAATTTCGTGCTC 1620 

ACCAAGACTCCCAAACAGAGGCTGCGCGAGCTGCTGGCAGCTGGGGCCAAGGTGCCCCCG 1680 
GACCTTCGCAGGCGCCTGGAGCGGCTAAGGGGACAGAAGGATTGA (SEQ ID NO: 5) 1725 

FIG. 10 



>CG54007-04 

MWGLLLALAAFAPAVGPALGAPRNSVLGLAQPGTTKVPGSTPALHSSPAQPPAETANGTS 6 0 

EQHWIRVIKKKKVIMKKRKKLTLTRPTPLVTAGPLVTPTPAGTLDPAEKQETGCPPLGL 120 

ESLRVSDSRLEAS S SQS FGLGPHRGRLNIQSGLEDGDLYDGAWCAEEQDADPWFQVDAGH 180 

PTRFSGVITQGRNSVWRYDWTSYKVQFSNDSRTWWGSRNHSSGMDAVFPANSDPETPVL 240 

NLLPEPQVARFIRLLPQTWLQGGAPCLRAEILACPVSDPNDLFLEAPASGSSDPLDFQHH 300 

NYKAmKLMKQVQEQCPNITRIYSIGKSYQGLKLYVMEMSDKPGEHELGEPEVRYVAGMH 360 

GNEALGRELLLLLMQFLCHEFLRGNPRVTRLLSEMRIHLLPSMNPDGYEIAYHRGSELVG 420 

WAEGRWNNQS IDLNHNFADLNTPLWEAQDDGKVPHI VPNHHLPLPTYYTLPNATVAPETR 480 

AVIKWMKRI PFVLSANLHGGELWS YPFDMVTAS AEGYHS VTRNCRVTFEEGPFPCNFVL 540 
TKTPKQRLRELLAAGAKVPPDLRRRLERLRGQKD (SEQ ID NO: 6) 574 



FIG. 11 



>CG54007-05 

ATGTGGGGGCTCCTGCTCGCCCTGGCCGCCTTCGCGCCGGCCGTCGGCCCGGCTCTGGGG 60 

GCGCCCAGGAACTCGGTGCTGGGCCTCGCGCAGCCCGGGACCACCAAGGTCCCAGGCTCG 120 

ACCCCGGCCCTGCATAGCAGCCCGGCACAGCCGCCGGCGGAGACAGCTAACGGGACCTCA 180 

GAACAGCATGTCCGGATTCGAGTCATCAAGAAGAAAAAGGTCATTATGAAGAAGCGGAAG 240 

AAGCTAACTCTAACTCGCCCCACCCCACTGGTGACTGCCGGGCCCCTTGTGACCCCCACT 300 

CCAGCAGGGACCCTCGACCCCGCTGAGAAACAAGAAACAGGCTGTCCTCCTTTGGGTCTG 360 

GAGTCCCTGCGAGTTTCAGATAGCCGGCTTGAGGCATCCAGCAGCCAGTCCTTTGGTCTT 420 

GGACCACACCGAGGACGGCTCAACATTCAGTCAGGCCTGGAGGACGGCGATCTATATGAT 480 

GGAGCCTGGTGTGCTGAGGAGCAGGACGCCGATCCATGGTTTCAGGTGGACGCTGGGCAC 540 

CCCACCCGCTTCTCGGGTGTTATCACACAGGGCAGAGATCCTGGCCTGCCCAGTCTCAGA 600 

CCCCAATG ACCTATTCCTTGAGGCCCCTGCGTCGGGATCCTCTGACCCTCTAGACTTTCA 660 

GCATCACAATTACAAGGCCATGAGGAAGCTGATGAAGCAGGTACAAGAGCAATGCCCCAA 720 

CATCACCCGCATCTACAGCATTGGGAAGAGCTACCAGGGCCTGAAGCTGTATGTGATGGA 780 

AATGTCGGACAAGCCTGGGGAGCATGAGCTGGGGGAGCCTGAGGTGCGCTACGTGGCTGG 840 

CATGCATGGGAACGAGGCCCTGGGGCGGGAGTTGCTTCTGCTCCTGATGCAGTTCCTGTG 900 

CCATGAGTTCCTGCGAGGGAACCCACGGGTGACCCGGCTGCTCTCTGAGATGCGCATTCA 960 

CCTGCTGCCCTCCATGAACCCTGATGGCTATGAGATCGCCTACCACCGGGGTTCAGAGCT 1020 

GGTGGGCTGGGCCGAGGGCCGCTGGAA.CAACCAGAGCATCGATCTTAACCATAATTTTGC 1080 

TGACCTCAACACACCACTGTGGGAAGCACAGGACGATGGGAAGGTGCCCCACATCGTCCC 1140 

CAACCATCACCTGCCATTGCCCACTTACTACACCCTGCCCAATGCCACCGTGGCTCCTGA 1200 

AACGCGGGCAGTAATCAAGTGGATGAAGCGGATCCCCTTTGTGCTAAGTGCCAACCTCCA 1260 

CGGGGGTGAGCTCGTGGTGTCCTACCCATTCGACATGACTCGCACCCCGTGGGCTGCCCG 1320 

CGAGCTCACGCCCACACCAGATGATGCTGTGTTTCGCTGGCTCAGCACTGTCTATGCTGG 1380 

CAGTAATCTGGCCATGCAGGACACCAGCCGCCGACCCTGCCACAGCCAGGACTTCTCCGT 1440 

GCACGGCAACATCATCAaCGGGGCTGACTGGCACACGGTCCCCGGGAGCATGAATGACTT 1500 

CAGCTACCTACACACCAACTGCTTTGAGGTCACTGTGGAGCTGTCCTGTGACAAGTTCCC 1560 

TCACGAGAATGAATTGCCCCAGGAGTGGGAGAACAACAAAGACGCCCTCCTCACCTACCT 1620 

GGAGCAGGTGCGCATGGGCATTGCAGGAGTGGTGAGGGACAAGGACACGGAGCTTGGGAT 1680 

TGCTGACGCTGTCATTGCCGTGGATGGGATTAACCATGACGTGACCACGGCGTGGGGCGG 1740 

GGATTATTGGCGTCTGCTGACCCCAGGGGACTACATGGTGACTGCCAGTGCCGAGGGCTA 1800 

CCATTCAGTGACACGGAfi.CTGTCGGGTCACCTTTGAAGAGGGCCCCTTCCCCTGCAA.TTT 1860 

CGTGCTCACCAAGACTCCCAAACAGAGGCTGCGCGAGCTGCTGGCAGCTGGGGCCAAGGT 1920 

GCCCCCGGACCTTCGCAGGCGCCTGGAGCGGCTAAGGGGACAGAAGGATTGA (SEQ ID NO: 7) 1972 



FIG. 12 
>CG54007-05 

MWGLLLALAAFAPAVGPALGAPRNSVLGLAQPGTTKVPGSTPALHSSPAQPPAETANGTS 6 0 

EQHVRIRVIKKKKVIMKKRKKLTLTRPTPLVTAGPLVTPTPAGTLDPAEKQETGCPPLGL 12 0 

ESLRVSDSRLEASSSQSFGLGPHRGRLNIQSGLEDGDLYDGAWCAEEQDADPWFQVDAGH 180 
PTRFSGVITQGRDPGLPSLRPQ (SEQ ID NO: 8) 202 



FIG. 13 



>CG54007-06 

ATGTGGGGGCTCCTGCTCGCCCTGGCCGCCTTCGCGCCGGCCGTCGGCCCGGCTCTGGGG 60 

GCGCCCAGGAACTCGGTGCTGGGCCTCGCGCAGCCCGGGACCACCAAGGTCCCAGGCTCG 120 

ACCCCGGCCCTGCATAGCAGCCCGGCACAGCCGCCGGCGGAGACAGCTAACGGGACCTCA 180 

GAACAGCATGTCCGGATTCGTGTCATC AAGAAGAAAAAGGTCATTATGAAGAAGCGGAAG 240 

AAGCTAACTCTAACTCGCCCCACCCCACTGGTGACTGCCGGGCCCCTTGTGACCCCCACT 300 

CCAGCAGGGACCCTCGACCCCGCTGAGAAACAAGAAACAGGCTGTCCTCCTTTGGGTCTG 360 

GAGTCCCTGCGAGTTTCAGATAGCCGGCTTGAGGCATCCAGCAGCCAGTCCTTTGGTCTT 420 

GGACCACACCGAGGACGGCTCAACATTCAGTCAGGCCTGGAGGACGGCGATCTATATGAT 480 

GGAGCCTGGTGTGCTGAGGAGCAGGACGCCGATCCATGGTTTCAGGTGGACGCTGGGCAC 540 

CCCACCCGCTTCTCGGGTGTTATCACACAGGGCAGGAACTCTGTCTGGAGGTATGACTGG 600 

GTCACATCATACAAGGTCCAGTTCAGCAATGACAGTCGGACCTGGTGGGGAAGTAGGAAC 660 

CACAGCAGTGGGATGGACGCAGTATTTCCTGCCAATTCAGACCCAGAAACTCCAGTGCTG 720 

AACCTCCTGCCGGAGCCCCAGGTGGCCCGCTTCATTCGCCTGCTGCCCCAGACCTGGCTC 780 

CAGGGAGGCGCGCCTTGCCTCCGGGCAGAGATCCTGGCCTGCCCAGTCTCAGACCCCAAT 840 

GACCTATTCCTTGAGGCCCCTGCGTCGGGATCCTCTGACCCTCTAGACTTTCAGCATCAC 900 

AATTACAAGGCCATGAGGAAGCTGATGAAGCAGGTACAAGAGCAATGCCCCAACATCACC 960 

CGCATCTACAGCATTGGGAAGAGCTACCAGGGCCTGAAGCTGTATGTGATGGAAATGTCG 1020 

GACAAGCCTGGGGAGCATGAGCTGGGGGAGCCTGAGGTGCGCTACGTGGCTGGCATGCAT 1080 

GGGAACGAGGCCCTGGGGCGGGAGTTGCTTCTGCTCCTGATGCAGTTCCTGTGCCATGAG 1140 

TTCCTGCGAGGGAACCCACGGGTGACCCGGCTGCTCTCTGAGATGCGCATTCACCTGCTG 12 00 

CCCTCCATGAACCCTGATGGCTATGAGATCGCCTACCACCGGGGTTCAGAGCTGGTGGGC 1260 

TGGGCCGAGGGCCGCTGGAACAACCAGAGCATCGATCTTAACCATAATTTTGCTGACCTC 1320 

AACACACCACTGTGGGAAGCACAGGACGATGGGAAGGTGCCCCACATCGTCCCCAACCAT 1380 

CACCTGCCATTGCCCACTTACTACACCCTGCCCAATGCCACCGTGGCTCCTGAAACGCGG 1440 

GCAGTAATCAAGTGGATGAAGCGGATCCCCTTTGTGCTAAGTGCCAACCTCCACGGGGGT 1500 

GAGCTCGTGGTGTCCTACCCATTCGACATGACTCGCACCCCGTGGGCTGCCCGCGAGCTC 1560 

ACGCCCACACCAGATGATGCTGTGTTTCGCTGGCTCAGCACTGTCTATGCTGGCAGTAAT 1620 

CTGGCCATGCAGGAC ACCAGCCGCCGACCCTGCCACAGCCAGGACTTCTCCGTGCACGGC 1680 

AACATCATCAACGGGGCTGACTGGCACACGGTCCCCGGGAGCATGAATGACTTCAGCTAC 1740 

CTACACACCAACTGCTTTGAGGTCACTGTGGAGCTGTCCTGTGACAAGTTCCCTCACGAG 1800 

AATGAATTGCCCCAGGAGTGGGAGAACAACAAAGACGCCCTCCTCACCTACCTGGAGCAG 1860 

GTGCGCATGGGCATTGCAGGAGTGGTGAGGGACAAGGACACGGAGCTTGGGATTGCTGAC 1920 

GCTGTCATTGCCGTGGATGGGATTAACCATGACGTGACCACGGCGTGGGGCGGGGATTAT 1980 

TGGCGTCTGCTGACCCCAGGGGACTACATGGTGACTGCCAGTGCCGAGGGCTACCATTCA 2040 

GTGACACGGAACTGTCGGGTCACCTTTGAAGAGGGCCCCTTCCCCTGCAATTTCGTGCTC 2100 

ACCAAGACTCCCAAACAGAGGCTGCGCGAGCTGCTGGCAGCTGGGGCCAAGGTGCCCCCG 2160 

GACCTTCGCAGGCGCCTGGAGCGGCTAAGGGGACAGAAGGATTGA (SEQ ID NO: 4) 2205 



Figure 14. 



>ptnr : SPTREMBL-ACC : Q9NUB5 DJ8S0F19.3 (NOVEL PROTEIN (ORTHOLOG OF MOUSE 

METALLOCARBOXYPEPTIDASE CPX-1) ) - Homo sapiens (Human), 734 aa. 
Length =734 

Score = 2735 (962.8 bits). Expect = 0.0, Sum P(2) = 0.0 
Identities = 510/510 (100%) , Positives = 510/510 (100%) 

Query: 1 MWGLLLMiAAFAPAVGPALGAPRNSVLGLAQPGTTKVPGSTPALHSSPAQPPAETANGTS 60 

MWGLLLALAAFAPAVGPALGAPRNSVLGLAQPGTTKVPGSTPALHSSPAQPPAETANGTS 
Sbjct: 1 MWGLLLALAAFAPAVGPALGAPRNSVLGLAQPGTTKVPGSTPALHSSPAQPPAETANGTS 60 

Query: 61 EQHVRIRVIKKKKVIMKKRKKLTLTRPTPLVTAGPLVTPTPAGTLDPAEKQETGCPPLGL 120 

EQHWIRVIKKKKVIMKKRKKLTLTRPTPLVTAGPLVTPTPAGTLDPAEKQETGCPPLGL 
Sbjct: 61 EQHTOIRVIKKKKVIMKKRKKLTLTRPTPLVTAGPLVTPTPAGTLDPAEKQETGCPPLGL 120 

Query: 121 ESLRVSDSRLEASSSQSFGLGPHRGRLNIQSGLEDGDLYDGAWCAEEQDADPWFQVDAGH 180 

ESLRVSDSRLEASSSQSFGLGPHRGRLNIQSGLEDGDLYDGAWCAEEQDADPWFQVDAGH 
Sbjct: 121 ESLRVSDSRLEASSSQSFGLGPHRGRLNIQSGLEDGDLYDGAWCAEEQDADPWFQVDAGH 180 

Query: 181 PTRFSGVITQGRNSVWRYDWVTSYKVQFSNDSRTWWGSRNHSSGMDAVFPANSDPETPVL 240 

PTRFSGVITQGRNSVWRYDWVTSYKVQFSNDSRTWWGSRNHSSGMDAVFPANSDPETPVL 
Sbjct: 181 PTRFSGVITQ/3RNSVWRYDWVTSYKVQFSNDSRTWWGSRNHSSGMDAVFPANSDPETPVL 24 0 

Query: 241 NLLPEPQVARFIRLLPQTWLQGGAPCLRAEILACPVSDPNDLFLEAPASGSSDPLDFQHH 300 

NLLPEPQVARFIRLLPQTWLQGGAPCLRAEILACPVSDPNDLFLEAPASGSSDPLDFQHH 
Sbjct: 241 NLLPEPQVARFIRLLPQTWLQGGAPCLRAEILACPVSDPNDLFLEAPASGSSDPLDFQHH 300 

Query: 301 NYKAMRKLMKQVQEQCPNITRIYSIGKSYQGLKLYVMEMSDKPGEHELGEPEVRYVAGMH 360 

NYKAMRKLMKQVQEQCPNITRIYSIGKSYQGLKLYVMEMSDKPGEHELGEPEVRYVAGMH 
Sbjct: 301 NYKAMRKLMKQVQEQCPNITRIYSIGKSYQGLKLYVMEMSDKPGEHELGEPEVRYVAGMH 360 

Query: 361 GNEALGRELLLLLMQFLCHEFLRGNPRVTRLLSEMRIHLLPSMNPDGYEIAYHRGSELVG 420 

GNEALGRELLLLLMQFLCHEFLRGNPRVTRLLSEMRIHLLPSMNPDGYEIAYHRGSELVG 
Sbjct: 361 GNEALGRELLLLLMQFLCHEFLRGNPRVTRLLSEMRIHLLPSMNPDGYEIAYHRGSELVG 420 

Query: 421 WAEGRWNNQSIDLNHNFADLNTPLWEAQDDGKVPHIVPNHHLPLPTYYTLPNATVAPETR 480 

WAEGRWNNQS IDLNHNFADLNTPLWEAQDDGKVPHIVPNHHLPLPTYYTLPNATVAPETR 
Sbjct: 421 WAEGRV^QSIDIjNHNFADLNTPLWEAQDDGKVPHIVPNHHLPLPTYYTLPNATVAPETR 480 

Query: 481 AVIKWMKRIPFVLSANLHGGELWSYPFDM 510 

AVIKWMKRIPFVLSANLHGGELWSYPFDM 
Sbjct: 481 AVIKWMKRIPFVLSANLHGGELWSYPFDM 510 

Score = 341 (120.0 bits). Expect = 0.0, Sum P(2) = 0.0 
Identities = 67/69 (97%) , Positives = 67/69 (97%) 

Query: 507 PFD-MVTASAEGYHSVTRNCRVTFEEGPFPCNFVLTKTPKQRLRELLAAGAKVPPDLRRR 565 

P D MVTASAEGYHSVTRNCRVTFEEGPFPCNFVLTKTPKQRLRELLAAGAKVPPDLRRR 
Sbjct: 666 PGDYMVTASAEGYHSVTRNCRVTFEEGPFPCNFVLTKTPKQRLRELLAAGAKVPPDLRRR 725 

Query: 566 LERLRGQKD 574 (SEQ ID NO: 6) 

LERLRGQKD (SEQ ID NO: 41) 

Sbjct: 726 LERLRGQKD 734 (SEQ ID NO:42) 



Figure 15 

>ptnr-.SPTREMBL-ACC:Q9NUB5 DJ860F19.3 (NOVEL PROTEIN (ORTHOLOG OF MOUSE 

METALLOCARBOXYPEPTTDASE CPX-1) ) - Homo sapiens (Human), 734 aa. 
Length =734 



Score = 1005 (353.8 bits), Expect = 4.4e-101, P = 4.4e-101 
Identities = 192/193 (99%) , Positives = 193/193 (100%) 



Query: 


1 


WGLLLALAAFAPAVGPALGAPRNSVLGLAQPGTTKyPGSTPALHSSPAQPPAETANGTS 


60 




MWGLLLALAAFAPAVGPALGAPRNSVLGLAQPGTTKVPGSTPALHSSPAQPPAETANGTS 




Sbjct : 


1 


MWGLLLALAAFAPAVGPALGAPRNSVLGLAQPGTTKVPGSTPALHSSPAQPPAETAKGTS 


60 


Query: 


61 


EQHVRIRVIKKKKVIMKKRKKLTLTRPTPLVTAGPLVTPTPAGTLDPAEKQETGCPPLGL 


120 




EQHVRIRVIKKKKVIMKKRKKLTLTRPTPLVTAGPLVTPTPAGTLDPAEKQETGCPPLGL 




Sbjct: 


61 


EQHVRIRVIKKKKVIMKKRKKLTLTRPTPLVTAGPLVTPTPAGTLDPAEKQETGCPPLGL 


120 


Query: 


121 


ESLRVSDSRLEASSSQSFGLGPHRGRLNIQSGLEDGDLYDGAWCAEEQDADPWFQVDAGH 


180 




ESLRVSDSRLEASSSQSFGLGPHRGRLNIQSGLEDGDLYDGAWCAEEQDADPWFQVDAGH 




Sbjct: 


121 


ESLRVSDSRLEASSSQSFGLGPHRGRLNIQSGLEDGDLYDGAWCAEEQDADPWFQVDAGH 


180 


Query: 


181 


PTRFSGVITQGRD 193 ( SEQ ID NO : 8 ) 








PTRFSGVITQGR+ (SEQ ID NO: 43) 




Sbjct: 


181 


PTRFSGVITQGRN 193 ( SEQ ID NO: 44) 





Figure 16 

>ptnr : SPTREMBL- ACC : Q9NUB5 DJ860F19.3 (NOVEL PROTEIN (ORTHOLOG OF MOOSE 

METALLOCARBOXYPEPTIDASE CPX-1) ) - Homo sapiens (Human), 734 aa. 
Length =734 

Score = 3952 (1391.2 bits). Expect = 0.0, P = 0.0 
Identities = 734/734 (100%), Positives = 734/734 (100%) 

Query: 1 MWGLLLALAAFAPAVGPALGAPRNSVLGLAQPGTTKVP6STPALHSSPAQPPAETANGTS SO 

MWGLLIALAAFAPAVGPALGAPRNSVLGLAQPGTTKVPGSTPALHSSPAQPPAETANGTS 
Sbjct: 1 MWGLLLALAAFAPAVGPALGAPRNSVLGLAQPGTTKVPGSTPALHSSPAQPPAETANGTS 60 

Query: 61 EQHVRIRVIKKKKVIMKKRKKLTLTRPTPLVTAGPLVTPTPAGTLDPAEKQETGCPPLGL 120 

EQHVRIRVIKKKKVIMKKRKKLTLTRPTPLVTAGPLVTPTPAGTLDPAEKQETGCPPLGL 
Sbjct: 61 EQHVRIRVIKKKKVIMKKRKKLTLTRPTPLVTAGPLVTPTPAGTLDPAEKQETGCPPLGL 120 

Query: 121 ESLRVSDSRLEASSSQSFGLGPHRGRLNIQSGLEDGDLYDGAWCAEEQDADPWFQVDAGH 180 

ESLRVSDSRLEASSSQSFGLGPHRGRLNIQSGLEDGDLYDGAWCAEEQDADPWFQVDAGH 
Sbjct: 121 ESLRVSDSRLEASSSQSFGLGPHRGRLNIQSGLEDGDLYDGAWCAEEQDADPWFQVDAGH 180 

Query: 181 PTRFSGVITQGRNSVWRYDWVTSYKVQFSNDSRTWWGSRNHSSGMDAVFPANSDPETPVL 240 

PTRFSGVITQGRNSVWRYDWVTSYKVQFSNDSRTWWGSRNHSSGMDAVFPANSDPETPVL 
Sbjct: 181 PTRFSGVITQGRNSVWRYDWVTSYKVQFSNDSRTWWGSRNHSSGMDAVFPANSDPETPVL 240 

Query: 241 NLLPEPQVARFIRLLPQTWLQGGAPCLRAEILACPVSDPNDLFLEAPASGSSDPLDFQHH 300 

NLLPEPQVARFIRLLPQTWLQGGAPCLRAEILACPVSDPNDLFLEAPASGSSDPLDFQHH 
Sbjct: 241 NLLPEPQVARFIRLLPQTWLQGGAPCLRAEILACPVSDPNDLFLEAPASGSSDPLDFQHH 300 

Query: 301 NYKAMRKLMKQVQEQCPNITRIYSIGKSYQGLKLYVMEMSDKPGEHELGEPEVRYVAGMH 360 

NYKAMRKLMKQVQEQCPNITRIYSIGKSYQGLKLYVMEMSDKPGEHELGEPEVRYVAGMH 
Sbjct: 3 01 NYKAMRKLMKQVQEQCPNITRIYSIGKSYQGLKLYVMEMSDKPGEHELGEPEVRYVAGMH 3 60 

Query: 361 GNEALGRELLLLLMQFLCHEFLRGNPRVTRLLSEMRIHLLPSMNPDGYEIAYHRGSELVG 420 

GNEALGRELLLLLMQFLCHEFLRGNPRVTRLLSEMRIHLLPSMNPDGYEIAYHRGSELVG 
Sbjct: 361 GNEALGRELLLLLMQFLCHEFLRGNPRVTRLLSEMRIHLLPSMNPDGYEIAYHRGSELVG 420 

Query: 421 WAEGRWNNQSIDLNHNFADLNTPLWEAQDDGKVPHIVPNHHLPLPTYYTLPNATVAPETR 480 

WAEGRWNNQSIDIjraNFADLNTPLWEAQDDGKVPHIVPNHHLPLPTYYTLPNATVAPETR 
Sbjct: 421 WAEGRWNNQSIDLNHNFADLNTPLWEAQDDGKVPHIVPNHHLPLPTYYTLPNAWAPETR 480 

Query: 481 AVIKWMKRIPFVLSANLHGGELWSYPFDMTRTPWAARELTPTPDDAVFRWLSTVYAGSN 540 

AVIKWMKRIPFVLSANLHGGELWSYPFDMTRTPWAARELTPTPDDAVFRWLSTVYAGSN 
Sbjct: 481 AVIKWMKRIPFVLSANLHGGELVVSYPFDMTRTPWAARELTPTPDDAVFRWLSTVYAGSN 540 

Query: 541 IJMQDTSRRPCHSQDFSVHGNIINGADVfflTVPGSMNDFSYLHTNCFEVTVELSCDKFPHE 600 

LAMQDTSRRPCHSQDFSVHGNI INGADWHTVPGSMNDFSYLHTNCFEVTVELSCDKFPHE 
Sbjct: 541 LAMQDTSRRPCHSQDFSWGNIINGADVfflTVPGSMNDFSYLHTNCFEVTVELSCDKFPHE 600 

Query: 601 NELPQEWENNKDALLTYLEQVRMGIAGVVRDKDTELGIADAVIAVDGINHDVTTAWGGDY 660 

NELPQEWENNKDALLTYLEQVRMGIAGWRDKDTELGIADAVIAVDGINHDVTTAWGGDY 
Sbjct: 601 NELPQEWENNKDALLTYLEQVRMGIAGWRDKDTELGIADAVIAVDGINHDVTTAWGGDY 660 

Query: 661 mLLTPGDYIWTASAEGYHSVTRNCRVTFEEGPFPCNFVLTKTPKQRLRELLAAGAKVPP 720 

WRLLTPGDyMVTASAEGYHSVTRNCRVTFEEGPFPCNFVLTKTPKQRLRELLAAGAKVPP 
Sbjct: 661 WRLLTPGDYMVTASAEGYHSVTRNCRVTFEEGPFPCNFVLTKTPKQRLRELLAAGAKVPP 720 

Query: 721 DLRRRLERLRGQKD 734 (SEQ ID NO : 2 ) 

DLRRRLERLRGQKD (SEQ ID NO : 4 5 ) 
Sbjct: 721 DLRRRLERLRGQKD 734 (SEQ ID NO: 46) 



kDa 



116 




6 



KKLTLTRPTPLVTAGPL 



KKLTLTRPPPLV-AGPL 



Figure 18 ClustalW alignment of CG54007-01, CG54007-04 and CG54007-05 proteins of the 
present invention. 



CG54007-05 
CG54007_01 
CG54007-04 



mo LL L ALA A FAPAVGPALGAP RNS VLGLAQPGT T K V PG STPA LH3 S PAQP PAETANGT S 
m<3 LL L A LA A FAP A VGPALGAPR.NS VLGLAQPGT TK VPG STPA LHSS P AQ P PAETANGT S 
mo L L L A L A A FAPAVGPALGAP R.NS VLGLAQPGT TKVPGSTPA LH S S PAQP PAETANGT S 



CG54007-05 
CG54007JH 
CG54007-04 



EOHVP. I RV I KKKKV I IvIKKRKKLT LTRPT P L VTAGP L VT P TPAGT LDPAEKQETGC P PLG L 
EQHVR I RV I KKKKV I IvIKKRKKLT LTRPT PL VTAGP L VT P TPAGT LDPAEKQETGC P PLG L 
EOHVR I RV I KKKKV I IvIKKRKKLT LTRPT P L VTAGP L VT P TPAGT LDPAEKQETGC P PLG L 



CG54007-05 
CG54007 01 
CG54Q073)4 



E 3 LRV S DS RLEAS 3 S Q S FG LGPHRGRLM I Q SGLED GD LYDGAWC A EEQDADPWFQVDAGH 
E S LRV 3 DS RLEAS 3 S 0 S FG LGPHRGRLM I Q 3GLED GD LYDGAWC A EEQDADPWFQVDAGH 
E 3 LRV S DS RLEAS S S O S FG LGPHRGRLN I 0 SGLED GD LYDGAWC A EEQDADPWFQVDAGH 



CG54007-05 
CG54007JD1 
CG54007-04 



P T R F S G V I T QGRja m a 1 1 lid iijS 

P T RF 3 G V I TQGRN S VWRYDWVT 3 TKVQF 3 NDSRTWWG S RNHS 3 GIvIDAVFF AN 3 DPETP V L 
P T RF 3 G V I TQGRN 3 VWRYDWVT S YKVQF 3 NDSRTWWG S RNHS 3 GIvIDAVF P ANS DP ETP V L 



CG54007-05 
CG54007_01 
CG54007-04 



N L L P E P 0 V A R F I R L L P OTWLOGG A P C LRA E I LAC P V S D P NDL F L E AP AS GS S D P LD FOH H 
N LLPE P OVA RFIRLLPQTWLQGGAPCLRAE I LAC P V SDPNDLF LEAPASGS S DPLDFQHH 



CG54007 01 
CG5W7-D4 



NYKAIvIRKLIvIKOVQEOCPNI TRI Y3 I GKS YOGLKLYVIvIEIvISDKP GEHELGEP EVRYV AGIvIH 
N YK AIvIRKLIvIKOVQEOC PN I TRI YS I GKS YOGLKLYVIvIEMSDKP GEHELGEP EVRYV AGMH 



CG54007-05 
CG54007_01 
CG54007-04 



G N E A L G R E L L LLLIvIQ F LCH EFL RGN PR V T R LLS EIvIR I H L LP S MN P DGYEI A YHRG : 
G N E A L G R E L L LLLMO F LCH EFL RGN PR V T R LLS EMR I HL LPS MN P DGYE I A YHRG : 



CG54007-05 
CG54007 01 
CG540073)4 



ffAEGRWMNQ : 
iVAEGRWNNO : 



I QLMHN FAD LNT P LWEAODDGKVPH I V PNHHLP L P T Y YT L P N A T \ 
I DLNHN FAD LNT P LWEAODDGKVPH I V PNHHLP LPTYYTLPNATli 



CG54007-05 
CG54007 01 
CG540075)4 



AV IKWMKRI P F V L 3 AN LHG G EL V V S YP F DM 
A V I KWMKR I P FiV L S AN LHG G EL V V S YP F DM : 



TRTPWA ARE LTP T PDDAVFRWLS TVYAG S N 



CG54007-05 
CG54007 01 
CG540075)4 



L AMQD T SRRPCHSQDF SVHGNI I NGADWHTVPGSMNDF S YLHTNC FEVTVELS CDKFPH E 



CG54007-05 
CG54007_01 
CG54007-04 



NELPQEWENN KDA LLTYLE QVRMG I AG V V RDKDT E L G I A DAV I AVDGINHDVT TAWGGD Y 
TAS 



CG54007-05 
CG54007 01 
CG54007-04 



WRL LT P GDYMVTA S 



AEGYHS VT RNCRVTF E EGPF P CNF V LTKTPKORLRELLAAGAKV P F 
AEGYHS VT RNCRVTF E EGPF P CNF V LTKT PKQRLRELLAAGAKV P P 



CG54007-05 
CG54007 01 
CG54007-04 



DLRRRLERLRGQKD 
DLRRRLERLRGOKD 



Fig. 19 



BrdU Incorporation 
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Proliferation of NIH 3T3 Cells 
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